Amendments to the Claims 
Please cancel Claims 41 and 42 without prejudice to or disclaimer of the subject 
matter contained therein. 

Please amend Claims 19, 20, 23, 39, and 40 and add new Claims 43-58 as 

follows. 

1 . (Original) An apparatus for detecting the presence of speech within an input 
audio signal, comprising: 

a memory for storing a predetermined function which gives, for a given set of 
audio signal values, a probability density for parameters of a predetermined speech model which 
is assumed to have generated the set of audio signal values, the probability density defining, for a 
given set of model parameter values, the probability that the predetermined speech model has 
those parameter values, given that the speech model is assumed to have generated the set of 
audio signal values; 

means for receiving a set of audio signal values representative of an input audio 

signal; 

means for applying the set of received audio signal values to said stored function 
to give the probability density for said model parameters for the set of received audio signal 
values; 

means for processing said function with said set of received audio signal values 
applied to obtain values of said parameters that are representative of said input audio signal; and 
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means for detecting the presence of speech using said obtained parameter values. 

2. (Original) An apparatus according to claim 1, wherein said processing means 
comprises means for drawing samples from said probability density function and means for 
determining said values of said parameters that are representative of the speech from said drawn 
samples. 

3. (Original) An apparatus according to claim 2, wherein said drawing means is 
operable to draw samples iteratively from said probability density function. 

4. (Original) An apparatus according to claim 2, wherein said processing means 
comprises a Gibbs sampler. 

5. (Original) An apparatus according to claim 2, wherein said processing means 
is operable to determine a histogram of said drawn samples and wherein said values of said 
parameters are determined from said histogram. 



6. (Original) An apparatus according to claim 5, wherein said processing means 
is operable to determine said values of said parameters using a weighted sum of said drawn 
samples, and wherein the weighting is determined from said histogram. 



7. (Original) An apparatus according to claim 1, wherein said receiving means is 
operable to receive a sequence of sets of signal values representative of an input audio signal and 
wherein said applying means, processing means and detecting means are operable to perform 
their function with respect to each set of received audio signal values in order to determine 
whether or not each set of received signal values corresponds to speech. 

8. (Original) An apparatus according to claim 7, wherein said processing means 
is operable to use the values of parameters obtained during the processing of a preceding set of 
signal values as initial estimates for the values of the corresponding parameters of a current set of 
signal values being processed. 

9. (Original) An apparatus according to claim 7, wherein said sets of signal 
values in said sequence are non-overlapping. 

10. (Original) An apparatus according to claim 1, wherein said speech model 
comprises an auto-regressive process model, wherein said parameters include auto-regressive 
model coefficients and wherein said detecting means is operable to compare the value of at least 
one of said auto-regressive model coefficients with a prestored threshold value. 

11. (Original) An apparatus according to claim 10, wherein said detecting means 
is operable to compare the values of a plurality of said auto-regressive model coefficients with a 
corresponding plurality of predetermined values. 



12. (Original) An apparatus according to claim 1, wherein said processing means 
is operable to vary the number of parameters used to represent the speech within the audio signal 
values and wherein said detecting means is operable to compare the number of parameters used 
to represent speech within the audio signal values with a predetermined threshold value, in order 
to detect the presence of speech within said audio signal. 

13. (Original) An apparatus according to claim 1, wherein received speech signal 
values are representative of a speech signal generated by a speech source as distorted by a 
transmission channel between the speech source and the receiving means; wherein said 
predetermined function includes a first part having first parameters which models said source and 
a second part having second parameters which models said channel; wherein said processing 
means is operable to obtain parameter values of at least said first parameters; and wherein said 
detecting means is operable to detect the presence of speech within said input audio signal from 
the obtained values of said first parameters. 

14. (Original) An apparatus according to claim 13, wherein said function is in 
terms of a set of raw speech signal values representative of speech generated by said source 
before being distorted by said transmission channel, wherein the apparatus further comprises 
second processing means for processing the received set of signal values with initial estimates of 
said first and second parameters, to generate an estimate of the raw speech signal values 
corresponding to the received set of audio signal values and wherein said applying means is 



operable to apply said estimated set of raw speech signal values to said function in addition to 
said set of received signal values. 

15. (Original) An apparatus according to claim 14, wherein said second 
processing means comprises a simulation smoother. 

16. (Original) An apparatus according to claim 14, wherein said second 
processing means comprises a Kalman filter. 

17. (Original) An apparatus according to claim 13, wherein said second part is a 
moving average model and wherein said second parameters comprise moving average model 
coefficients. 

18. (Original) An apparatus according to claim 1, further comprising means for 
evaluating said probability density function for the set of received audio signal values using one 
or more derived samples of parameter values for different numbers of parameter values, to 
determine respective probabilities that the predetermined speech model has those parameter 
values and wherein said processing means is operable to process at least some of said derived 
samples of parameter values and said evaluated probabilities to determine said values of said 
parameters that are representative of the audio speech signal. 

19. (Currently Amended) A speech recognition system comprising: 
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m e ans for receiving an input signal r e pres e ntative of an audio signal; 

an apparatus according to claim 1 for detecting the presence of speech within the 
an input signal; and 

recognition processing means for performing a recognition processing of the 
portion of the input signal corresponding to speech. 

20. (Currently Amended) A speech processing system comprising: 
m e ans fo r rec e iving an input audio signal; 

an apparatus according to claim 1 for detecting the presence of speech within the 
an input audio signal; and 

means for processing the portion of the input audio signal corresponding to 

speech. 

21. (Original) A method of detecting the presence of speech within an input 
audio signal, comprising: 

storing a predetermined function which gives, for a given set of audio signal 
values, a probability density for parameters of a predetermined speech model which is assumed 
to have generated the set of audio signal values, the probability density defining, for a given set 
of model parameter values, the probability that the predetermined speech model has those 
parameter values, given that the speech model is assumed to have generated the set of audio 
signal values; 
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receiving a set of audio signal values representative of an input audio signal at a 

receiver; 

applying the set of received audio signal values to said stored function to give the 
probability density for said model parameters for the set of received audio signal values; 

processing said function with said set of received audio signal values applied to 
obtain values of said parameters that are representative of said input audio signal; and 

detecting the presence of speech using said obtained parameter values. 



22. (Original) A method according to claim 21 , wherein said processing step 
comprises the steps of drawing samples from said probability density function and determining 
said values of said parameters that are representative of the speech from said drawn samples. 



23. (Currently Amended) A method according to claim 22, wherein said drawing 
step d r aw draws samples iteratively from said probability density function. 

24. (Original) A method according to claim 22, wherein said processing step 
uses a Gibbs sampler. 



25. (Original) A method according to claim 22, wherein said processing step 
determines a histogram of said drawn samples and wherein said values of said parameters are 
determined from said histogram. 



26. (Original) A method according to claim 25, wherein said processing step 
determines said values of said parameters using a weighted sum of said drawn samples, and 
wherein the weighting is determined from said histogram. 

27. (Original) A method according to claim 21, wherein said receiving step 
receives a sequence of sets of signal values representative of an input audio signal and wherein 
said applying step, processing step and detecting step are performed on each set of received audio 
signal values in order to determine whether or not each set of received signal values corresponds 
to speech. 

28. (Original) A method according to claim 27, wherein said processing step 
uses the values of parameters obtained during the processing of a preceding set of signal values 
as initial estimates for the values of the corresponding parameters of a current set of signal values 
being processed. 

29. (Original) A method according to claim 27, wherein said sets of signal values 
in said sequence are non-overlapping. 

30. (Original) A method according to claim 21, wherein said speech model 
comprises an auto-regressive process model, wherein said parameters include auto-regressive 
model coefficients and wherein said detecting step compares the value of at least one of said 
auto-regressive model coefficients with a pre-stored threshold value. 
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3 1 . (Original) A method according to claim 30, wherein said detecting step 
compares the values of a plurality of said auto-regressive model coefficients with a 
corresponding plurality of predetermined values. 

32. (Original) A method according to claim 21, wherein said processing step 
varies the number of parameters used to represent the speech within the audio signal values and 
wherein said detecting step compares the number of parameters used to represent speech within 
the audio signal values with a predetermined threshold value, in order to detect the presence of 
speech within said audio signal. 

33. (Original) A method according to claim 21, wherein received speech signal 
values are representative of a speech signal generated by a speech source as distorted by a 
transmission channel between the speech source and the receiver; wherein said predetermined 
function includes a first part having first parameters which models said source and a second part 
having second parameters which models said channel; wherein said processing step obtains 
parameter values of at least said first parameters; and wherein said detecting step detects the 
presence of speech within said input audio signal from the obtained values of said first 
parameters. 

34. (Original) A method according to claim 33, wherein said function is in terms 
of a set of raw speech signal values representative of speech generated by said source before 
being distorted by said transmission channel, wherein the apparatus further comprises a second 



processing step of processing the received set of signal values with initial estimates of said first 
and second parameters, to generate an estimate of the raw speech signal values corresponding to 
the received set of audio signal values and wherein said applying step applies said estimated set 
of raw speech signal values to said function in addition to said set of received signal values. 

35. (Original) A method according to claim 34, wherein said second processing 
step uses a simulation smoother. 

36. (Original) A method according to claim 34, wherein said second processing 
step uses a Kalman filter. 

37. (Original) A method according to claim 33, wherein said second part is a 
moving average model and wherein said second parameters comprise moving average model 
coefficients. 

38. (Original) A method according to claim 21, further comprising the step of 
evaluating said probability density function for the set of received audio signal values using one 
or more derived samples of parameter values for different numbers of parameter values, to 
determine respective probabilities that the predetermined speech model has those parameter 
values and wherein said processing step processes at least some of said derived samples of 
parameter values and said evaluated probabilities to determine said value of said parameters that 
are representative of the audio speech signal. 
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39. (Currently Amended) A speech recognition method comprising the ste p s of : 
r e c e iving an input signal r e presentative of an audio signal; 
a method according to claim 21 for detecting the presence of speech within the an 
input signal; and 

performing a recognition processing of the portion of the input signal 
corresponding to speech. 



40. (Currently Amended) A speech processing method comprising th e st ep s of : 
r e ceiving an input audio signal; 

a method according to claim 21 for detecting the presence of speech within the an 
input audio signal; and 

processing the portion of the input audio signal corresponding to speech. 



41-42. (Cancelled) 



43. (New) An apparatus for detecting the presence of speech within an input 
audio signal, comprising: 

a memory operable to store a predetermined function which gives, for a given set 
of audio signal values, a probability density for parameters of a predetermined speech model 
which is assumed to have generated the set of audio signal values, the probability density 
defining, for a given set of model parameter values, the probability that the predetermined speech 
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model has those parameter values, given that the speech model is assumed to have generated the 
set of audio signal values; 

a receiver operable to receive a set of audio signal values representative of an 
input audio signal; 

an applicator operable to apply the set of received audio signal values to said 
stored function to give the probability density for said model parameters for the set of received 
audio signal values; 

a processor operable to process said function with said set of received audio signal 
values applied to obtain values of said parameters that are representative of said input audio 
signal; and 

a detector operable to detect the presence of speech using said obtained parameter 

values. 

44. (New) An apparatus according to claim 43, wherein said processor comprises 
a sampler operable to draw samples from said probability density function and a determiner 
operable to determine said values of said parameters that are representative of the speech from 
said drawn samples. 

45. (New) An apparatus according to claim 44, wherein said processor comprises 
a Gibbs sampler. 
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46. (New) An apparatus according to claim 44, wherein said processor is operable 
to determine a histogram of said drawn samples and wherein said values of said parameters are 
determined from said histogram. 

47. (New) An apparatus according to claim 46, wherein said processor is operable 
to determine said values of said parameters using a weighted sum of said drawn samples, and 
wherein the weighting is determined from said histogram. 

48. (New) An apparatus according to claim 43, wherein said receiver is operable 
to receive a sequence of sets of signal values representative of an input audio signal and wherein 
said applicator, processor and detector are operable to perform their function with respect to each 
set of received audio signal values in order to determine whether or not each set of received 
signal values corresponds to speech. 

49. (New) An apparatus according to claim 48, wherein said processor is operable 
to use the values of parameters obtained during the processing of a preceding set of signal values 
as initial estimates for the values of the corresponding parameters of a current set of signal values 
being processed. 

50. (New) An apparatus according to claim 43, wherein said speech model 
comprises an auto-regressive process model, wherein said parameters include auto-regressive 
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model coefficients and wherein said detector is operable to compare the value of at least one of 
said auto-regressive model coefficients with a prestored threshold value. 

5 1 . (New) An apparatus according to claim 43, wherein said processor is operable 
to vary the number of parameters used to represent the speech within the audio signal values and 
wherein said detector is operable to compare the number of parameters used to represent speech 
within the audio signal values with a predetermined threshold value, in order to detect the 
presence of speech within said audio signal. 

52. (New) An apparatus according to claim 43, wherein received audio signal 
values are representative of a speech signal generated by a speech source as distorted by a 
transmission channel between the speech source and the receiver, wherein said predetermined 
function includes a first part having first parameters which models said source and a second part 
having second parameters which models said channel, wherein said processor is operable to 
obtain parameter values of at least said first parameters, and wherein said detector is operable to 
detect the presence of speech within said input audio signal from the obtained values of said first 
parameters. 

53. (New) An apparatus according to claim 43, wherein said function is in terms 
of a set of raw speech signal values representative of speech generated by said source before 
being distorted by said transmission channel, wherein the apparatus further comprises a second 
processor operable to process the received set of signal values with initial estimates of said first 
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and second parameters, to generate an estimate of the raw speech signal values corresponding to 
the received set of audio signal values and wherein said applicator is operable to apply said 
estimated set of raw speech signal values to said function in addition to said set of received 
signal values. 

54. (New) An apparatus according to claim 43, further comprising an evaluator 
operable to evaluate said probability density function for the set of received audio signal values 
using one or more derived samples of parameter values for different numbers of parameter 
values, to determine respective probabilities that the predetermined speech model has those 
parameter values and wherein said processor is operable to process at least some of said derived 
samples of parameter values and said evaluated probabilities to determine said values of said 
parameters that are representative of the audio speech signal. 

55. (New) A speech recognition system comprising: 

a receiver operable to receive an input signal representative of an audio signal; 

a memory operable to store a predetermined function which gives, for a given set 
of audio signal values, a probability density for parameters of a predetermined speech model 
which is assumed to have generated the set of audio signal values, the probability density 
defining, for a given set of model parameter values, the probability that the predetermined speech 
model has those parameter values, given that the speech model is assumed to have generated the 
set of audio signal values; 
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an applicator operable to apply a set of audio signal values representative of the 
input signal to said stored function to give the probability density for said model parameters for 
the set of audio signal values; 

a processor operable to process said function with said set of received audio signal 
values applied to obtain values of said parameters that are representative of said input audio 
signal; 

a detector operable to detect the presence of speech using said obtained parameter 

values; and 

recognition processor operable to perform a recognition processing of the portion 
of the input signal corresponding to speech. 

56. (New) A speech processing system comprising: 
a receiver operable to receive an input audio signal; 

a memory operable to store a predetermined function which gives, for a given set 
of audio signal values, a probability density for parameters of a predetermined speech model 
which is assumed to have generated the set of audio signal values, the probability density 
defining, for a given set of model parameter values, the probability that the predetermined speech 
model has those parameter values, given that the speech model is assumed to have generated the 
set of audio signal values; 

an applicator operable to apply a set of audio signal values representative of the 
input signal to said stored function to give the probability density for said model parameters for 
the set of audio signal values; 
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a first processor operable to process said function with said set of received audio 
signal values applied to obtain values of said parameters that are representative of said input 
audio signal; 

a detector operable to detect the presence of speech using said obtained parameter 

values; and 

a second processor operable to process the portion of the input audio signal 
corresponding to speech. 

57. (New) A computer readable medium storing computer executable instructions 
for causing a programmable computer deice to carry out a method of detecting the presence of 
speech within an input audio signal, the instructions comprising instructions for: 

storing a predetermined function which gives, for a given set of audio signal 
values, a probability density for parameters of a predetermined speech model which is assumed 
to have generated the set of audio signal values, the probability density defining, for a given set 
of model parameter values, the probability that the predetermined speech model has those 
parameter values, given that the speech model is assumed to have generated the set of audio 
signal values; 

receiving a set of audio signal values representative of an input audio signal at a 

receiver; 

applying the set of received audio signal values to said stored function to give the 
probability density for said model parameters for the set of received audio signal values; 
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processing said function with said set of received audio signal values applied to 
obtain values of said parameters that are representative of said input audio signal; and 
detecting the presence of speech using said obtained parameter values. 

58. (New) Computer executable instructions for causing a programmable 
computer device to carry out a method of detecting the presence of speech within an input audio 
signal, the instructions comprising instructions for: 

storing a predetermined function which gives, for a given set of audio signal 
values, a probability density for parameters of a predetermined speech model which is assumed 
to have generated the set of audio signal values, the probability density defining, for a given set 
of model parameter values, the probability that the predetermined speech model has those 
parameter values, given that the speech model is assumed to have generated the set of audio 
signal values; 

receiving a set of audio signal values representative of an input audio at a 

receiver; 

applying the set of received audio signal values to said stored function to give the 
probability density for said model parameters for the set of received audio signal values; 

processing said function with said set of received audio signal values applied to 
obtain values of said parameters that are representative of said input audio signal; and 

detecting the presence of speech using said obtained parameter values. 
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